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(57) Abstract 

The invention provides a recombinant virus vector for use as an immunotherapeutic or vaccine. The recombinant virus vec- 
tor comprises at least one pair of nucleotide sequences heterologous to the virus and which have sufficient sequence homology 
that recombination between them might be expected. The pair f nucleotide sequences are arranged in the virus vector such that 
they are inverted with respect to each ther. The virus vector is able to infect a mammalian h st cell and express as polypeptide 
the heterologous nucleotide sequences in the h st cell. For infection thought to be caused by HPV infection, the pair f nucleotide 
sequences encode part or all of human papill mavirus (HPV) wild-type proteins or mutant proteins immunol gically cross-reac- 
tive therewith. For an immunotherapeutic or vaccine against cervical cancer, the recombinant virus vect r encodes part r all of 
the HPV wild-type proteins HPV16E7 and HPV18E7 or mutant proteins immunologically cross-reactive therewith.- 
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RECOMBINANT VIRUS VECTORS ENCODING HUMAN PAPILLOMAVIRUS PROTEINS 
This invention relates to recombinant virus vectors. 
In particular, it relates to recombinant virus vectors 
designed to overcome the problem of recombination 
5 between homologous nucleotide sequences. It also 
relates to recombinant virus vectors encoding human 
papil lomavirus proteins ; to immunotherapeutics and 
vaccines for conditions associated with HPV infection; 
to the production of a virus (e.g. vaccinia virus) 

10 engineered to express antigens encoded by human 
papillomavirus types 16 and 18 and to 
immunotherapeutics and vaccines for cervical cancer. 

In recent years, strong evidence has been adduced 
for a link between cervical carcinoma and infection 

15 with certain types of human papillomavirus (HPV), 
particularly with types 16, 18, 31, 33 and 35 (Gissman 
et al., Cancer Cells 5,275, 1987). This is based on 
hybridisation studies which have indicated that more 
than 85-90% of biopsies from cervical tumours can be 

20 shown to contain papillomavirus DNA. HPV 16 DNA is 
most commonly found (in about 60% of tumours) with 
HPV18 the next most frequent (about 20%) and the other 
types accounting for a further 5-10%. In many 
instances, tumour cells from the biopsies do not 

25 however, contain the complete genome, but rather a 
deleted form. The extent and location of the deleted 
information within the virus g nome is variable, but 
a gen ral feature is the retention of the part f the 
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genome encoding -the E7 protein (Schwarz et al., Nature 
314 , 111 , 1985 ) . In addition, the adjacent 
E6 -encoding region is usually present. The ubiquitous 
presence of the E7-encoding region in tumour cells 
5 suggests that the protein product of this gene might 
play a role in the induction or maintenance of the 
transformed phenotype. Indeed in most cell lines 
established from tumour biopsies, expression of the E7 
gene can be detected (Smotkin & We tt stein, PNAS, 83, 
10 4680, 1986). Furthermore, it has been shown that the 
E7 gene product can bind to the retinoblastoma (Rb) 
gene product, a recognised " ant i -oncogene" in normal 
human cells (Munger et al., EMBO J. 8,4099, 1989). 
This strengthens the belief that E7 is directly 
15 involved in cell transformation. 

The presence and expression of the E7 and E6 
genes in tumour cells derived from cervical carcinoma 
biopsies, suggests the possibility that these proteins 
could be potential targets for the immunological 
20 recognition of the tumour cells. It is well known 
that viral proteins produced inside mammalian cells 
can be processed through a host cell pathway to short 
peptides, which then form a complex with host Major 
Histocompatibility Complex (MHC) Class 1 molecules and 
25 are transported to the cell surface. These complexes 
may then present a target for recognition by the host 
immune system. Interaction of the complex with the 
receptor molecule on th surf ac of cytotoxic T cells 
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(the T cell recept r) can then lead to activation of 
the T cells to proliferate r to d stroy the 
recognised cell. It is possible, therefore, that the 
presence in the body of a population of cytotoxic T 
5 lymphocytes (CTLs) which are capable of recognising 
cells expressing the HPV E6 and/or E7 proteins could 
afford protection against the development and 
proliferation of cervical tumours. Indeed it has been 
reported that normally oncogenic mouse cells 
10 engineered to express the HPV E7 protein are unable to 
form tumours in mice which have been previously 
immunised with non-tumorigenic E7-expressing cells, 
and that this rejection is mediated by CD 8+ 
lymphocytes (CTLs) (Chen et al., PNAS 88, 110, 1991). 
1 5 Further , the generation of an active population of 
such cells subsequent to tumour initiation could 
result in regression of the tumour. 

There are numerous reports on the construction of 
recombinant viruses eg vaccinia viruses containing, 
20 and expressing foreign genes (Mackett & Smith, J. gen. 
Virol. 67,2067, 1986), and several reports of the use 
of these recombinant viruses to generate effective 
immune responses against the expressed foreign 
antigens. A particular advantage of this route for 
25 delivery of antigens for vaccination is that it may 
lead to the development of cellular as well as humoral 
immunity. This is becaus the foreign proteins will b 
produced Inside cells of the infected individual in a 
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manner similar to that which occurs during natural 
infection. This means that they should be proc ssed 
through the correct pathway to allow generation of a 
CTL response. In several cases , it has been 
5 demonstrated directly that immunisation with the 
recombinant virus is capable of producing a cellular 
Immune response in the form of foreign 
antigen-specific CTLs (Moss & Flexner, Ann. Rev. 
Immunol., 5,305, 1987). Furthermore, vaccination of 

10 animals with recombinant vaccinia viruses expressing 
certain tumour-specific antigens, such as the human 
melanoma-associated antigen P97 (Estin et al., PNAS, 
85, 1052, 1988), the bovine papillomavirus E7 protein 
(Meneguzzi et al., Vaccine, 8, 199, 1990) and the 

15 human breast cancer-associated antigen ETA (Hareuveni 
et al., PNAS, 87, 9498, 1990) has been demonstrated to 
result in the induction of immunity against tumour 
initiation and progression. 

The present applicants have recognised the 

20 desirability of producing a recombinant virus vector 
which is useful as an immunotherapeutic or vaccine for 
conditions caused by HFV infection, for example for 
cervical cancer. With respect to cervical cancer, the 
art at the time of the applicants making the present ♦ 

25 invention recognised the E7 gene as having the 

» 

potential to immortalise cells. Therefore, it would 
be felt inappropriate to incorporate the E7 gene in an 
iramunoth rapeutic. The applicants however, have 
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recognized th surprising us fulness of including the 
E7 gen in an iramuno therapeutic. Th y have also 
recognized that the beneficial effects to be gained by 
treatment with an immunotherapeutic comprising the E7 
5 gene are likely to outweigh by far any risk associated 
with the oncogenic activity of the E7 gene. Thus, an 
aspect of the applicants invention involves the use of 
a recombinant virus vector which expresses an E7 gene, 
as an immunotherapeutic or vaccine. Furthermore, the 

10 applicants provide embodiments of their invention in 
which these risks are reduced still further by 
specific alteration of the gene sequences in order to 
reduce the oncogenic potential of the E7 gene without 
compromising its ability to stimulate an appropriate 

15 immune response. 

The present applicants have also recognized that 
where a number of HPV proteins which may be encoded by 
different HPV strains are implicated as being 
associated with a particular HPV- associated condition 

20 (for example, cervical carcinoma, HP VI 6 nd HPV18; 
genital warts, condyloma acuminata, respiratory 
papillomatosis, HPV6 and HPV11; squamous cell 
carcinoma in immunosuppressed individuals, HFV5 and 
, HPV8), rather than produce a plurality of recombinant 

25 viruses engineered separately to express each of the 
implicated proteins, it would be advantageous to 
produc a single virus recombinant which is able to 
express part or all of the sequences of more than one 
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of the proteins. Thus, with rspct to cervical 
cancer, rather than produce four recombinant viruses 
engineered separately to express each of the potential : 
targets for immunological recognition of cervical 
5 tumour cells i.e. the HPV16 E6, HPV16 E7, HPV18 E6 and 
HPV18 E7 proteins, it would be especially advantageous 
to produce a single virus recombinant which is able to 
express the part or all of the sequences of more than 
one of the proteins, preferably at least two of the 

10 proteins and most preferably all four proteins. That 
the present applicants are able to achieve this is 
particularly surprising. This is because, the coding 
sequences for many HPV proteins are highly homologous 
to other equivalent HPV proteins (for example from 

15 other virus strains). Thus, the HPV16 E6 and HPV18 E6 
proteins show overall homology of 62% and comprise 
regions of very high homology. The same is true for 
HPV16 E7 and HPV18 E7 which show overall homology of 
57%, with particular regions of very high homology. 

20 This means that one would expect recombination to 
create problems such as loss of gene sequences. The 
applicants have, however, devised a novel strategy 
designed to minimise the likelihood of such 
recombination events and to circumvent the deleterious » 

25 effect of those events should they indeed arise. 
Thus, surprisingly, the invention provides recombinant 
virus vectors which comprise at least one pair of 
nucleotide sequences which have sufficient sequence 
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h mology that recombination between them might be 
expected. The at least one pair of nucleotide 
sequences may encode part or all of human 
papillomavirus (HPV) wild- type proteins or mutant 
5 proteins immunologically cross -reactive therewith. In 
particular , the invention provides a recombinant 
vector which can maintain stably, and express, part or 
all of four of the desired gene sequences from HPV16 
and HPV18. 

10 Thus, the present invention provides a 

recombinant virus vector for use as an 
immunotherapeutic or vaccine which comprises at least 
one pair of nucleotide sequences heterologous to said 
virus and which have sufficient sequence homology that 

15 recombination between them might be expected wherein 
said pair of nucleotide sequences are arranged in said 
virus vector such that they are inverted with respect 
to each other to reduce the likelihood of 
recombination events leading to loss of part or all of 

20 said sequence and said virus vector is able to infect 
a mammalian host cell and express as polypeptide the 
heterologous nucleotide sequences in said host cell. 
The at least one pair of nucleotide sequences may 
encode part or all of human papillomavirus (HPV) wild- 

25 type proteins or mutant proteins immunologically 
cross-reactive therewith. The pair of nucleotide 
sequences may encode part or all of the protein E7 
from both HPV16 and HPV18 or functional equivalents 
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ther of. The pair of nucleotide sequences may encode 
part or all of the proteins E6 from both HPV16 and 
HFV18 or functional equivalents thereof. 

The recombinant virus vector may comprise a 
5 further pair of nucleotide sequences heterologous to 
said virus and which (i) have sufficient sequence 
homology that recombination between them might be 
expected wherein said further pair of nucleotide 
sequences are arranged in said virus vector such that 
10 they are inverted with respect to each other and said 
virus vector is able to infect a mammalian host cell 
and express as polypeptide the further pair of 
heterologous nucleotide sequences in said host cell. 
The further pair of nucleotide sequences may 
15 encode part or all of HPV wild-type proteins or mutant 
proteins immunologically cross-reactive therewith. 

For example, the present invention also provides 
a recombinant virus vector which in addition to the E7 
coflfng .sequences, also comprises and is adapted to 
20 express genetic sequences encoding part or all of the 
protein E6 from both HFV16 and HP VI 8 or functional 
equivalents thereof. The genetic sequences may 
comprise sequences encoding HPV 16 E6/E7 and HPV18 
E6/E7 as shown in figures 1(a) and 1(b) 
25 respectively . 

The genetic sequences may encode an antigenic 
moiety of the said proteins. 

Either or both of the nucleotide sequences in a 
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pair of nucleotide sequences may be altered t make 
them less homologous than an equivalent pair of 
nucleotide sequences encoding wild- type HFV proteins. 
The alteration in nucleotide sequence may be in an 
5 area of high sequence homology . Preferably, the 
alteration in nucleotide sequence will not result in 
an alteration of the encoded amino acid sequence. 

Two or more nucleotide sequences each encoding 
separate proteins may be fused together to form a 
10 single open reading frame. Thus the genetic sequences 
encoding part or all of the proteins E6 and E7 from 
HPV16, may be fused together to form a single open 
reading frame. The genetic sequences encoding part or 
all of the proteins E6 and E7 from HPV18, may be fused 
15 together to form a single open reading frame. The 
genetic sequences encoding part or all of the proteins 
E6 and E7 from both HPV16 and HPV18, may be fused 
together to form a single open reading frame. Thus, 
the recombinant virus vector may have the pairs of 
20 nucleotide sequences arranged according to any one of 
the options shown in Figure 26. Where the recombinant 
virus vector comprises an open reading frame having a 
fused genetic sequence encoding part or all of the 
proteins E6 and E7 from HPV16, and a separate open 
25 reading frame having fused genetic sequences encoding 
part or all of the proteins E6 and E7 from HP VI 8, the 
two open reading frames may be inverted with respect 
to on another. For example, the two open reading 
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frames may be arranged in the recombinant virus vector 
adjacent to each other. The inversion may be such 
that the E6 coding sequences of HPV16 and HFV18 are 
both located between the E7 coding sequences of HPV16 
5 and HPV18. Alternatively, the inversion could be such 
that the E7 coding sequences of HPV16 and HPV18 are 
both located between the E6 coding sequences of HPV16 
and HPV18. In particular the two open reading frames, 
each with its respective promoter, may be arranged 
10 next to each other in the recombinant vector. In this 
case the promoters may be located between the genes, 
which are transcribed outwardly, or the promoters may 
be located outside the genes, which are transcribed 
inwardly. 

15 Similarly, the genetic sequences encoding part or 

all of the E7 protein from HPV16 and the E7 protein 
from HPV18 may be fused together to form a single open 
reading frame. The genetic sequences encoding part or 
all of the E6 protein from HPV16 and the E6 protein 

20 from HPV18 may be fused together to form a single open 
reading frame . This leads to another range of 
arrangements similar to those shown in Figure 26. The 
fusions may be via a single codon encoding a 
relatively small neutral amino acid e.g. glycine. 

25 Thus the present invention also provides a 

recombinant virus vector which comprises a first open 
reading frame having a fused genetic sequence encoding 
part or all of the wild-type proteins E6 and E7 from 
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HPV16; and a separat second pen reading frame having 
a fused genetic sequence encoding part or all of the 
wild-type proteins E6 and E7 from HPV18; wherein the 
first and second open reading frames may be inverted 
5 with respect to one another whereby either: 1) the E6 
coding sequences of HPV16 and HPV18 are both located 
between the E7 coding sequences of HPV16 and HP VI 8; or 
ii) the E7 coding sequences of HPV16 and HPV18 are 
both located between the E6 coding sequences of HFV16 
10 and HPV18; and wherein any of said wild- type proteins 
may be replaced by a mutant protein immunologically 
cross-reactive therewith. 

Each of the first and second open reading frames 
may have a corresponding promoter and the two open 
15 reading frames each with its promoter, are arranged 
next to each other in the virus. 

The present invention also provides a recombinant 
virus vector wherein either: i) the promoters are 
located between the first and second reading frames 
20 whereby the open reading frames are transcribed 
outwardly; or ii) the promoters are located outside 
the first and second open reading frames whereby the 
open reading frames are transcribed inwardly. 

The present invention also provides a recombinant 
25 virus vector which comprises a first open reading 
frame having a fused genetic sequence encoding part or 
all of the wild- type proteins E6 and E7 from HPV16; 
and a separate second open reading frame having a 
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fused genetic sequence encoding part: or all of the 
wild- type proteins E6 and E7 from HPV18; wherein the 
E6 coding sequences of HPV16 and HPV18 are both 
located between the E7 coding sequences of HPV16 and 
5 HPV18; and each open reading frame has a corresponding 
promoter , the promoters being located between the 
first and second open reading frames whereby the open 
reading frames are transcribed outwardly; and wherein 
any of said wild-type proteins may be replaced by a 
10 mutant protein immunologically cross -reactive 
therewith. 

The wild-type proteins HPV16E7 and HPV18E7 may be 
replaced with mutant proteins which are substantially 
homologous to said wild-type proteins and in which the 
15 residues cys 24 and glu 26 of wild-type protein 
HPV16E7 and the residues cys 27 and glu 29 of wild- 
type protein HPV18E7 are replaced with glycine 
residues. 

The recombinant virus vector may be derivable 

20 from vaccinia virus. 

The applicants have also recognized that for 
effective function as an immunotherapeutic, it is 
desirable for the recombinant virus to retain its 
ability to replicate and thereby generate an active 

25 infection in order that a cellular immune response may 
be mounted against the virus-encoded proteins. Thus, 
the applicants propose that the foreign gene sequences 
should be inserted into the vector virus at sites, the 
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disruption of which by the insertion of the 
heter log us gene sequences will not substantially 
interfere with, and therefore have a substantially 
adverse affect on any viral functions which relate to 
5 the replicative ability of the virus in the infected 
host animal . The applicants have named these sites 
'neutral sites' (although the term 'neutral 9 should 
not be interpreted strictly as it is acknowledged that 
the disruption of these sites may have a small, but 
10 relatively speaking inconsequential adverse affect on 
replicative ability). 

DNA sequences which affect virus replication can 
fall into several categories: 
i) protein coding sequences; 

15 ii) elements involved in control of gene expression; 
and iii) elements involved in virus DNA replication 
A non-essential and neutral insertion site must 
therefore avoid such regions, and, such sites have 
been identified on 
20 the basis of nucleotide sequencing studies. Thus the 
genetic sequences may be inserted into neutral sites 
within the virus genome . One or more genetic 
sequences may be inserted into the same neutral site. 

Neutral sites can be easily tested for according 
25 to techniques well known in the art. For example, a 
site may be s lected, interrupt d or deleted using 
standard methodologies and the resultant recombinant 
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virus placed in conditions which normally support 
growth of the wild type virus vector, t assess the 
effect of the manipulations. The pathogenicity of the 
virus may be further compared with that of the 
5 unmodified virus vector strain in animal models, in 
order to assess its level of attenuation. 

In the present invention, the virus vector may be 
vaccinia virus. The vaccinia virus may be attenuated 
or disabled so that it is unable to fully replicate 
10 and esablish an extensive infection of host cells* 

Vaccinia virus has been used extensively in the 
past for vaccination against smallpox, and its use 
worldwide has led to the complete eradication of the 
disease (Bhebehami, Microbiol. Rev., 47, 455, 1983). 
15 During the World Health Organisation (WHO) campaign to 
eradicate smallpox, several different strains of 
vaccinia virus were used as vaccines . In 1984 a 
meeting was sponsored by the WHO to discuss the use of 
vaccinia virus as live virus vectors (Bulletin of the 
20 WHO 63(3): 471-477). The data in this report indicates 
that the number of complications associated with 
vaccination was lowest for the Wyeth strain of 
vaccinia virus, and so this strain has been chosen as 
a basis for the construction of the recombinant virus 
25 according to an embodiment of the present invention. 

It is well known that insertion of foreign DNA 
into the genome of vaccinia virus at certain favoured 
sites, such as the thymidine kinase gene locus, can 
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reduce dramatically th ability of the virus to 
replicate in vivo. As discussed above, the aim of the 
therapeutic approach described here is to generate an 
active in vivo Infection, so that a cellular immune 
5 response may be mounted against the virus encoded 
proteins. The present invention provides a method for 
inserting foreign genes at neutral sites within the 
genome of a virus, the disruption of which sites by 
the insertion will not interfere with and therefore 
10 have a substantially adverse affect on virus 
replication. 

Where the virus is vaccinia virus, the neutral 
site may be identified herein within the Wyeth strain 
of vaccinia virus on the basis of the related WR 

15 strain nucleotide sequence. Alternatively, where 
other vaccinia virus strains are used, sites 
equivalent to those sites identified above may be 
used . The neutral sites may be any as identified 
hereinafter as A,B,C and D, or a functional 

20 equivalent. 

For successful expression of foreign proteins by 
the recombinant virus vector the foreign genes must be 
placed under the control of a promoter sequence which 
is operable by the virus. Thus the recombinant virus 

25 vector may comprise a single promoter which controls 
the expression of all the heterologous genetic 
sequences within a single open reading frame. 
Alternatively , wher the recombinant virus vector 
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encodes more than one open reading frame containing 
heterol g us genetic sequences, the virus may c mprlse 
a first promoter which controls the expression of the 
genetic sequences from a first open reading frame, and 
5 one or more further promoters which control the 
expression of the genetic sequences from one or more 
further open reading frames. The promoter sequence 
may be virus- specific and several have been 
characterised so far (Davison & Moss, J. Mol. Biol,, 
10 210, 749, 1989; Davison & Moss, J. Mol. Biol., 210, 
771, 1989). The single promoter and the first and one 
or more further promoters may be the p 7.5 promoter. 
There have been reports that the induction of foreign 
antigen-specific CTLs requires expression of the 
15 antigen early in the virus replication cycle (Coupar 
et al., Eur. J. Immunol., 16, 1479, 1986). Therefore, 
a recombinant virus as provided by the present 
invention may involve the use of the p7.5 promoter 
(Venkatesan et al.. Cell, 125, 805, 1981) and/or the 
20 H6 promoter (Rosel et al., J. Virol, 60, 436, 1988), 
both of which are active both early and late in 
infection. 

As mentioned earlier, it has been reported that 
the E7 gene on its own has the potential to 
25 immortalise cells (Phelps et al., Cell 53, 539, 1988). 
In an embodiment of the present invention, the 
strat gy for expression of the protein involves 
production of E7 as a fusion protein with E6, which is 
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unlikely to retain biological function. Embodiments of 
the Invention provide for reducing this risk still 
further, by making changes within the E7 gene which 
are known to destroy Its oncogenic capacity (Cheaters 
5 et al., J. Gen Virol. 71, 449. 1990). Thus in the 
recombinant virus vectors of the present invention, 
the genetic sequences encoding part or all of the E7 
proteins may be altered from the equivalent wild type 
sequences, in order to render the sequences, used In 

10 the recombinant virus vectors less oncogenic than 
their equivalent wild type sequences. 

The present invention also provides 
pharmaceuticals comprising recombinant virus vectors 
as herein defined. The pharmaceutical may be for use 

1 5 against a condition caused by HPV infection which 
comprises an immunotherapeutically effective amount of 
a recombinant virus vector. The pharmaceutical may be 
for use against cervical cancer. 

The pharmaceutical may be a vaccine to immunise 

20 against a condition caused by HPV infection which 
comprises an amount of recombinant virus vector as 
herein provided which when administered to a recipient 
can specifically activate cells of the immune system 
to HPV proteins. The vaccine may be for immunisation 

25 against cervical cancer. 

The pharmaceuticals may comprise one or more 
excipients. Th present invention also provides 
methods of using the recombinant virus vectors as 
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herein defined to make medicaments for use as 
immunotherapeutics or vaccines against conditi ns 
thought to be caused by HPV infection. For example 
for the prophylaxis and treatment of cervical cancer. 
5 The present invention also provides methods of 

treating mammalian patients with recombinant virus 
vectors and pharmaceuticals as herein provided. 

The present invention also provides a method of 
determining a neutral site in a virus vector, the 

10 disruption of which by the insertion of heterologous 
gene sequences will not interfere with, and therefore, 
have a substantially adverse affect on viral function 
which relates to the replicative ability of the virus. 
The method for this determination comprises: (a) 

15 analysing a viral genome to identify open reading 
frames which are likely to encode functional genes, 
by looking for expected codon usage between spaced 
apart start and stop codons; and (b) selecting sites 
which are not in such open reading frames, likely to 

20 encode functional genes, as identified in (a) This may 
include selecting sites between open reading frames 
for sequences of functional genes and selecting sites 
which are in open reading frames which have some 
functional gene characteristics, such as an expected 

25 codon usage, but have lost other essential 
characteristics such as a start codon. The method may 
also comprise interrupting or deleting the selected 
sites from the viral genome and placing the resultant 



WO 92/16636 



PCT/GB92/00424 



19 

virus in conditions which n rmally support growth 'of 
the wild type virus. 

The present invention also provides neutral sites 
identified by use of the above methods. 
5 The present invention provides an embodiment 

which shows a way of inducing a cellular immune 
response against the papillomavirus proteins usually 
expressed in cervical tumour cells by the creation of 
a recombinant vaccinia virus, which has been 

10 engineered to produce the HPV E6 and E7 proteins > or 
proteins containing HPV E6 and E7 sequences, during 
its replication cycle. This therapeutic vaccinia 
virus contains the E6 and E7 genes from both HPV16 and 
HPV 18 , the viruses most commonly associated with 

15 cervical carcinoma . Vaccination with this single 
virus may thus stimulate immunity to the E6 and E7 
proteins of the HPV types associated with more than 
80% of cervical tumours. Expression of all four gene 
sequences (e.g. HPV16 E6 and E7; HP VI 8 E6 and E7) in 

20 a single virus however presents a problem, because of 
the likelihood of loss of genetic sequences through 
recombination • The present invention provides a 
method for circumventing this difficulty, firstly 
through specific sequence alteration, in order to 

25 reduce sequence homology and secondly through their 
insertion into the vaccinia virus genome in such a way 
that if such recombination were to occur, it would not 
lead to loss of s quences (i.e. in inverted 
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orientation with respect to each other). Expression 
of the desired f ur gene sequences in the vaccinia 
virus genome could also be difficult (though not 
impossible ) to achieve as independent expression 
5 units, and so the invention provides that instead, the 
E6 and E7 open reading frames may be fused together. 
A problem with standard methods for Insertion of 
foreign information into the vaccinia virus genome is 
that the use of selectable markers to increase the 

10 efficiency of recombination results in the ultimate 
presence in the recombinant virus also of the 
selectable marker gene itself. Methods for insertion 
have been developed however, which allow subsequent 
elimination of these extraneous sequences (Falkner & 

15 Moss J. Virol., 64, 3108, 1990) and these are used in 
an embodiment of the present invention to ensure that 
the final recombinant vaccinia virus has only those 
additional sequences which are necessary for its 
required function. 

20 In order that the present invention is more fully 

understood an embodiment will now be described in more 
detail with reference to the figures in which: 

Figure 1(a) shows the nucleotide sequence and 
three-frame translation of HPV16 E6/E7 polymerase 

25 chain reaction product (underlined regions indicate 
the E6 and E7 coding sequences); Figure 1(b) shows the 
nucleotide sequence and three- frame translation of 
HP VI 8 E6/E7 polymeras chain reaction product 
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( underlined regions indicate the E6 and E7 coding 
sequences ) ; 

Figure 2 shows the cloning and modification of the 
HPV16 and HPV18 E6 and B7 genes ; 
5 Figure 3 shows an open reading frame plot of vaccinia 
virus from positions 17201-18450 of the region covered 
by the four fragments SalF,G,H and I; short vertical 
lines denote termination codons, lines topped with 
boxes denote initiation codons, rectangles show 

10 relevant open reading frames, and arrows show 
direction of upper and lower DNA strands; 
Figure 4 shows an open reading frame plot of vaccinia 
virus from positions 21001-22000 of the region covered 
by the four fragments SalF,G,H and I; short vertical 

1 5 lines denote termination codons , lines topped with 
boxes denote initiation codons , rectangles show 
relevant open reading frames and arrows show direction 
of upper and lower DNA strands; 

Figure 5 shows an open reading frame plot of vaccinia 
20 virus from positions 23501-25000 of the region covered 
by the four fragments SalF,G, H and 1; short vertical 
lines denote termination codons, lines topped with 
boxes denote initiations codons, rectangles show 
relevant open reading frames and arrows show the 
25 direction of upper and lower strands of DNA; 

Figure 6 shows a codon usage plot of vaccinia virus 
from positions 17201-18450 of the region covered by 
the four fragments SalF,G,H and I; arrows show 
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direction of each DNA strand; 

Figure 7 shows a codon usage plot of vaccinia virus 
from positions 21001-22000 of the region covered by 
the four fragments SalF r G,H and I ; arrows show 
5 direction of each DNA strand; 

Figure 8 shows a codon usage plot of vaccinia virus 
from positions 23501-25000 of the region covered by 
the four fragments SalF„G,H and I; arrows show 
direction of each DNA strand; 
10 Figure 9 shows the DNA sequence around site A showing 
translations in single letter amino acid code of genes 
SalF 17R and SalF 19R; 

Figure 10 shows the DNA sequence around site B showing 
translations in single letter amino acid code of genes 
15 SalF 20R and SalF20.5R; 

Figure 11 shows a comparison of the SalG2R open 
reading frame to the yeast guanylate kinase gene 
sequence; 

Figure 12 shows the DNA sequence around site D showing 
20 translations in single letter amino acid code of genes 
HindB3R and Hind B4R; 

Figure 13 shows the cloning of vaccinia virus (Wyeth 
strain) neutral sites; 

Figure 14 shows the cloning of vaccinia virus promoter 
25 sequences; 

Figure 15 shows the construction of vaccinia 
promoter-driven E6-7 cassette; 

Figure 16 shows the cloning of the E6-7 cassette into 
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vaccinia virus (Wyeth strain) n utral sites; 
Figure 17 is a diagram showing the recombination 
required to generate the final therapeutic vaccinia 
virus - HPV recombinant virus; 
5 Figure 18 shows the synthetic oligonucleotides used in 
the construction of the therapeutic vaccinia virus HPV 
recombinant; 

Figure 19 shows the nucleotide sequence of vaccinia 
virus (WR strain) from positions 17201-18450 of the 
10 region covered by the four fragments SalF,G,H and 1; 
Figure 20 shows the nucleotide sequence of vaccinia 
virus (WR strain) from positions 21001-22000 of the 
region covered by the four fragments SalF,G,H and I; 
and 

15 Figure 21 shows the nucleotide sequence of vaccinia 
virus (WR strain) from positions 23501-25000 of the 
region covered by the four fragments SalF,G,H and I. 
Figure 26 shows a variety of options for arrangment of 
HPV16E6 and E7 and HPV18E6 and E7 coding sequences in 

20 a recombinant virus vector. 

All cloning procedures are carried out according 
to the protocols described in "Molecular Cloning", A 
Laboratory Manual, eds. Sambrook, Fritsch and 
Maniatis, Cold Spring Harbor Laboratory Press, 1989* 

25 All plasmids on which site directed mutagenesis is 
performed are of the "phagemid" type, which may be 
converted to singl -stranded DNA by superinfection 
with the bact riophage fl. Preparation and sit 
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directed mutagenesis f single- stranded DNA, is 
carried out as described by Brier ley et al.. Cell, 57, 
537, 1989. The sequence of all the synthetic 
oligonucleotides used are provided in figure 18. 

5 Preparation of the E6 and E7 genes from HPV16 and 
HPV18 for insertion into vaccinia virus 
Cloning of the HPV16 and HPV18 E6 and E7 
A fragment of DNA containing the HPV16 E6/7 coding 
region is prepared by polymerase chain reaction (PCR) 
10 amplification from the plasmid pBR322/HPV16 (Durst et 
al., PNAS, 80 , 3812, 1983) using the oligonucleotides 
SOS and S06. A fragment containing the same region 
from HPV18 is prepared by the same procedure from 
plasmid pBR322/HPV18 (Boshart et al., EMBO J. 3,1151) 
15 using the oligonucleotides SOI and S02- Plasmids 
PBR322/HFV16 and pBR322/HPV18 are both available from 
Behringwerke AG, P.O. Box 1140 D-3550, Marburg, 
Germany (alternatively the necessary sequences can be 
created synthetically from the sequence information 
20 provided by the present application). 

In each case, this produces a DNA fragment of 
about 800 base pairs (bp) with a site for the 
restriction enzyme Nco 1 (CCATGG) located exactly at 
the beginning of the E6 gene, and a Smal site 
25 immediately downstream of the termination codon for 
the E7 gene (figure 1(a) and (b)). The products are 
then digested with Ncol and Smal, and cloned into 
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Ncol-Smal digested plasmid pUC118NS (a modified 
version of th "phagemid" pUC118 (Viera & Messing, 
Methods Enzymol., 153,3, 1987) in which Ncol and Smal 
sites have been created by site-directed mutagenesis 
5 within the poly-linker region) to generate the plasmid 
plMS7 , containing the HPV1 6 sequences , and pIMS8 
containing the HPV18 sequences (figure 2). The use of 
pUC118 is not crucial to the present strategy as any 
plasmid which can be manipulated by site directed 
10 mutagenesis can be successfully used. 

Fusion of the E6 and E7 ORFs 

For insertion into vaccinia virus, the E6 and E7 genes 
from each HFV type, are first fused together to form 
a single continuous ORF. This is achieved by 
15 site-directed mutagenesis as follows: 

(i) The termination codon TAA of HFV16 E6 in pIMS7 is 
altered using the oligonucleotide S20 to the sequence 
GGAA. This is in order to convert the normally 
separate ORFs for HPV16 E6 and E7 into a single ORF 

20 (pIMS7.1 - figure 2). 

(ii) The termination codon TAA of HFV18 E6 in pIMS8 is 
altered using the oligonucleotide S21 to the sequence 
G6AA • This is in order to convert the normally 
separate ORFs for HPV18 E6 and E7 into a single ORF 

25 (pIMS8.1 - figure 2). 

Abolition of the immortalising potential of E7 
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In order to destroy the immortalising properties of 
each of the E7 proteins, two key cod ns within the 
HFV16 E7 coding sequence, (cys24 and glu26 - figure 
1(a)) and the equivalent codons from HPV18 E7 (cys27 
5 and glu29 - figure 1(b)), are altered to glycine 
residues by site directed mutagenesis as follows. 

(i) The sequence of the E7 gene is altered in pIMS7 
to encode glycine at codons 24 and 26 ( normally 
encoding cysteine and glutamate respectively, using 

10 oligonucleotide S22 (pIMS7.2 - figure 2). 

(ii) The sequence of the E7 gene is altered in pIMS8 
to encode glycine at codons 27 and 29 ( normally 
encoding cysteine and glutamate respectively, using 
oligonucleotide S23 (pIMS8.1B - figure 2). 

15 Reduction in intertypic recombination potential of 
HPV16 and HPV18 E6 and E7 sequence and elimi nation of 
potential vaccinia virus transcription termination 
signal 

A potential difficulty with the presence of both HP VI 6 
20 and HP VI 8 E6 and E7 specific DNA within the genome of 
a single virus, is that recombination between the two 
sets of related sequences could lead to loss or 
rearrangement of information such that expression of 
the required proteins is disrupted. The invention 
25 provides ways of minimising this risk. Firstly, by 
inserting the two sets of genetic information in the 
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vaccinia genome in opposite orientation to each other 
(so that recombinati n will result not in the loss of 
sequence information, but rather in its inversion). 
Secondly, by creating specific changes in the E6/7 
5 sequence of one of the HPV virus strains at sites 
where the homology is greatest. These changes however 
are made in such a way that the amino acid coding 
potential of the genes remains unaltered. 

The HP VI 8 E6 sequences is therefore altered by 
10 site-directed mutagenesis as follows: 

The sequence TTTTTATTCTAGAATTAGAG (which begins 
210 nucleotides from the start of E6 - underlined 
in figure 1(b)) is mutated, using oligonucleotide 
S24 to the sequence TTTCTACAGTAGAATCAGAG (pIMS8.2 
15 - figure 2) (changed nucleotides are in bold 

type). 

A second aim of this change is to eliminate from 
the HPV18 £6 sequence, the sequence TTTTTAT, which is 
a potential termination signal for the early vaccinia 
20 virus transcription enzyme (Rohrmann et al. , Cell. , , 46, 
1029, 1986). 

Source and propagation of vaccinia virus 
The Wyeth strain of vaccinia virus is used for 
construction of the therapeutic virus. It is propagated 
25 in Vero cells for the purposes of genetic manipulation, 
and in the human diploid fibroblast cell line MRC5 for 
th production of th final therapeutic virus st ck. 
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Both cell lines are obtained from the National Institute 
of Biological Standards and Control, South Mims, U.K. . 
The Wyeth strain of vaccinia virus, vero cells and the 
cell line MRC5 are also available from the American Type 
5 Culture Collection, 12301 Parklawn Drive, Rockville, 
Maryland 20852, USA. 

Identification and cloning of neutral sites from the 

vaccinia virus 

Description of neutral sites 
10 For the purpose of insertion of papillomavirus genes 

within the vaccinia virus genome, sites have been chosen 

to have two characteristics. 

Firstly, they should be non-essential regions, i.e. 

insertion of foreign genes at these points, will not 
15 disrupt any functions of the vaccinia virus to the 

extent that the virus can no longer grow in tissue 

culture. 

Secondly, they should be neutral sites , i.e. 
insertion of foreign genes all these points, will not 

20 increase or decrease the level of attenuation of the 
vaccinia virus. 

The difference between these two factors can be 
seen by looking at the thymidine kinase (TK) gene of 
vaccinia virus. It is a non-essential region, and hence 

25 viruses with genes inserted in the TK gene can grow well 
in tissue culture (Mackett at al. r J. Virol., 49, 857, 
1984). However, such viruses have been found to be 
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greatly attenuated in vivo (Buller et al., Nature 317, 
813, 1985)* For the purpose of prophylactic 

vaccination , such Increased attenuation might be 
desirable. However, for an immunotherapeutlc strategy 
5 where the danger from the disease to be treated clearly 
outweighs the risk of vaccine associated complications, 
use of an attenuated virus is considered undesirable 
since it could compromise the immunological response to 
the papillomavirus antigens. Hence, the applicants have 
10 identified sites which they judge will not attenuate the 
virus any further, and have termed them 'neutral sites * . 
Such sites have been identified within the virus genome 
by careful analysis of the DNA sequence of the WR 
strain. The WR strain was originally derived from the 
15 Wyeth strain by passage in mouse brain. Therefore the 
two strains are closely related. The nucleotide 
sequence of three regions of the WR genome which contain 
the selected neutral sites are shown in figures 19 , 20 
and 21. Four neutral sites (A-D) have been chosen on 
20 the above discussed criteria as follows : 
Site A: gap between SalF17R and SalF19R 
Site B: gap between SalF19R and SalF20.5R 
Site C: in Sal62R, a potential non-functional gene 
Site D: in HindB3.5R, a potential non- functional gene 
25 These sites (A-D) can be identified by the following 
stretches of DNA sequence , each of which is 40 
nucleotides in length. 

A CTATCTACCAGATTATTATGTGTTATAAGGTACTTTTTCT 
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B TATTGTGCTACTGATTCTTCACAGACTGAAGATTGTTGAA 
C TCTCTTAAAATGGTTGAGACCAAGCTTCGTTGTAGAAACA 
D TGAGGCTACCTCGACATACGTGTGCGCTATCAAAGTGGAA 
In other strains these sequences may vary, while still 
5 having substantial homology with those given above. In 
particular a site may have at least 90%, more preferably 
95%, homology with the sequence given above. 
Figures 3-5 show the distribution of initiation codons 
and open reading frames (ORFs) in the regions of the 
10 vaccinia virus genome shown in figures 19,20 and 21. 
Figures 6-8 show the same regions with a plot showing 
to what extent each reading frame conforms to the 
pattern of codon usage expected for vaccinia genes. A 
graph of codon usage is plotted for each of the three 
15 possible reading frames in each direction (Staden, R. , 
Nucl. Acids Res., 12, 521, 1984; Staden, R. , Nucl. Acids 
Res., 12, 551, 1984). In these codon usuage plots, the 
short vertical bar lines extending from the horizontal 
axes represent start codons. The longer vertical bar 
20 lines placed above the horizontal axes represent stop 
codons. This sort of plot is a useful way of helping to 
determine whether a particular ORF is a genuine vaccinia 
gene. Where there is a likely genuine gene, the graph 
of codon usuage will rise between a start codon and a 
25 stop codon. For example, in figure 7, it can be seen 
that the graph of codon usage rises over the region of 
the SalG2R ORF (the dott d lin shows that this frame 
conforms most of the expected codon usage). For the 
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other two frames th graphs show that they do not 
conf rm t vaccinia codon usage. The peak labelled 
'part of gk', and marked with a dashed line, also 
conforms well to vaccinia codon usage* In summary, a 
5 genuine gene must start with an initiation (start) 
codon, end with a termination (stop) codon, and should 
conform well to vaccinia codon usage along its length. 
In most cases the conformation to the vaccinia codon 
usage drops off sharply outside the gene. 

10 The neutral sites are further described as follows 
Site A. Gap between SalF17R and SalF19R 
Site A is marked on figures 3 and 6. Figure 9 shows the 
actual DNA sequence with a translation of the ORFs on 
either side of the site. It can be seen that Site A is 

15 placed in an intergenic region between SalF17R and 
SalF19R. It Is placed some 195 bases upstream of 
SalF19R to avoid any promoter elements associated with 
that gene. The sequence TTTTTCT (shown in Italics) will 
act as a terminator of early RNA transcription for the 

20 SalF17R gene if it is an early gene. However, the site 
is placed downstream of the first of these, so it will 
not affect early termination of transcription if it 
occurs. Examination of figure 6 shows that there is no 
recognisable gene on the opposite strand at this point, 

25 and hence this sequence location is suitable as a 
neutral insertion site. 

Site B. Gap between SalF19R and SalF20,5R 

Site B is marked on figures 3 and 6. Flgur 10 shows 
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the actual DNA sequence with a translation of the ORFs 
on either side of the site. It can be seen that Site 
B is placed in an intergenic region between SalF19R and 
SalF20.5R. Figure 6 shows that it is within a region 
5 of high vaccinia codon usage , but that this region does 
not form a genuine gene, having no initiation codon. 
In addition, figure 6 suggests that Sal20.5R is not a 
complete gene, as the conformation to vaccinia codon 
usage drops off dramatically at the start of the gene. 

10 In the event that SalF20.5R is a genuine gene. Site B 
is placed some 70 bases upstream of SalF20.5R which may 
well avoid any promoter elements associated with that 
gene. (Note: many vaccinia promoter elements are 
located in approximately 35 bases upstream of the start 

15 of the gene. ) In addition SalF20R has no TTTTTNT (N-any 
nucleotide) transcription termination signal with which 
Site B could interfere. Hence this sequence location 
is suitable as a neutral insertion site. 



Site C. Within SalG2R, a potential non- functional gene 
20 Site C is marked on figure 4. The ORF SalG2R has 
considerable similarity to the guanylate kinase (GK) 
gene of yeast. This similarity is shown in figure 11. 
Sequence upstream of the SalG2R ORF (but in a different 
frame) has been added on to SalG2R, to see if the match 
25 to GK extends beyond the boundaries of the original open 
reading frame. Th match appears to extend beyond the 
5 1 end of the SalG2R ORF. In particular, an important 
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site in the yeast GK gene, th ATP/GTP binding site 
(shown underlined) only matches in the out of frame 
sequence upstream of the SalG2R ORF. Hence, it is very 
likely that the SalG2R gene is not active as a guanylate 
5 kinase and can be referred to as a 'pseudogene' . If the 
gene is inactive as the applicants deduce, then it will 
serve as a neutral insertion site. 

Site D. Within HlndB3.5R, a potential non- functional 
gene 

1 0 Figure 5 shows that site D lies within the region 
designated HindB3.5R. This region, although conforming 
to vaccinia codon usage, has no start codon and is 
therefore not a genuine gene. The codon usage plot 
shown in figure 8 indicates that it probably was once 

15 a functional gene, and may well have been attached to 
HindB3R (a shift in the codon usage preference occurs 
here well away from the termination codon of the HindB3R 
ORF which suggests that the last section of HlndB3R is 
not properly part of this gene. ) Hence it is likely 

20 that HindB3«5R is not active as a gene and can be used 
as a neutral insertion site. Figure 12 shows the actual 
DNA sequence with a translation of the ORFs on either 
side of the site. It can be seen that site D is placed 
in an intergenic region between HindB3R and HindB4R as 

25 well as being within the non- functional HindB3.5R. 

Preparation of v ctor for cloning of neutral sites 

In order to insert foreign genetic information into the 
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neutral sites described above, DNA copies of the neutral 
sites, together with an appropriate amount of flanking 
DNA from the vaccinia genome (approximately 500 bases 
on either side) must first be cloned into a plasmid 
5 vector. These plasmids may then be used to introduce the 
foreign DNA into the vaccinia virus genome; the vaccinia 
virus 'flanking sequences' around the inserted gene 
serve to allow homologous recombination between the 
plasmid DNA and the viral DNA, with the consequent 
10 insertion of the foreign gene at the desired location. 

Cloning of neutral site sequences 

Plasmids containing flanking regions from the neutral 
sites are constructed as follows. DNA is prepared from 
the Wyeth strain of vaccinia virus by the method of 

15 Esposito et al., (J. Virol. Meth. 2: 175, 1981). The 
polymerase chain reaction (PCR) is used to remove an 
approximately 1000 base pair (bp) fragment from DNA of 
the Wyeth strain of vaccinia virus. Pairs of 
oligonucleotides are chosen approximately 500 bp either 

20 side of the chosen neutral site. These oligonucleotides 
are based on the sequence of the WR strain, but are 
chosen in regions where the sequence of the WR strain 
is identical to that of the Copenhagen strain (Goebel 
et al.. Virology 179:247, 1990). The oligonucleotides 

25 incorporate restriction enzyme recognition sequences so 
that they can be clon d easily into a plasmid. For 
neutral sites A,B and D the restriction sites are EcoRl 
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and HindllX. For neutral sit C the Hindi 1 1 site is 
replaced by an Sphl site, since there is an internal 
Hindlll site in the chosen flanking sequences. 
The oligonucleotides used for PCR are listed below: 
5 Site A leftMB 16 

Site A rightMB 17 

Site B leftMB 24 

Site B rightMB 25 

Site C leftMB 18 
10 Site C rightMB 19 

Site D leftMB 22 

Site D rightMB 23 

DNA fragments of approximately lkb are then prepared 
using these pairs of oligonucleotides by PCR 
15 amplification, digested with EcoRI and Hindlll (for site 
A,B and 0) or with EcoRI and Sphl (for site C) and 
cloned into Hindlll and EcoRI -digested pUC118 (figure 
13) to generate the plasmids pIMMC7a, pIMMC7b, pIMMC7c 
and pIMMC7d. 

20 Creation of unique restriction sites for insertion at 
the neutral sites 

A suitable restriction enzyme site is then introduced, 
at the selected location within each of the plasmids. 
This is achieved using site directed mutagenesis using 
25 an oligonucleotide containing the desired new unique 
sit and flank d by 15 bases of sequ nee to either side 
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(see below). The plasmids modified in this fashion are 
designated pIMMC8a-d (figure 13). 

original oligonucleotidesite new plasmid 
plasmld Introduced 

5 

pIMMC7a MB35SnaBl pIMMC8a 

pIMMC7b MB36Hpal pIMMCb 

pIMMC7c MB37Stul pIMMC8c 

pIMMC7d MB38SnaBl pIMMC8d 

10 Cloning of the vaccinia virus early/late promoter 
sequences 

The p7.5 and H6 promoters from vaccinia virus genomic 
DNA are prepared by PCR amplification as described 
below. 

15 A pair of complementary oligonucleotides (S7 and S8) is 
synthesised to include the following restriction enzyme 
sites, Hindlll, SnaBl, Hpal, Hindlll, Sail, Ncol, Smal, 
SnaBl and EcoRl, such that the pair, after annealing, 
present at one end Hindu I compatible overhanging ends, 

20 and at the other, EcoRl compatible overhanging ends. 
The two oligonucleotides are allowed to anneal and are 
inserted into pUC118 cut with EcoRl and Hindlll (figure 
14). The resulting vector is called pIMMC3. 

A DNA molecule of approximately 180 bp containing 

25 the H6 promoter is removed from the WR strain of 
vaccinia virus by PCR amplification using the 
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olig nucle tides MB15 (anneals upstream and includes a 
5 '-Sail site) and MB7 (anneals downstr am and includes 
a 5 '-Hindlll site). This is cloned into plMMC3 cleaved 
with Hindlll and Sail to create pIMMC4a (figure 14). 
5 A DNA molecule of approximately 200bp containing the 
p7.5 promoter is then removed from the WR strain of 
vaccinia virus by PCR amplification using the 
oligonucleotides MB32 (anneals upstream and includes a 
5 v -Sail site) and MB33 (anneals downstream and includes 
10 a 5* Ncol site). This is cloned into pIMMC3 cleaved with 
Ncol and Sail to create pIMMC14b. 
Construction of the therapeutic virus 

The strategy required to generate a recombinant vaccinia 
virus containing and expressing the E6-E7 proteins from 
15 HPV16 and HPV18, based on the elements described above 
involves five main stages as outlined below, 
i) Cloning of the modified E6-7 genes downstream of 

vaccinia early promoter sequences 

A DNA fragment containing the modified HPV16 E6-7 
20 sequence is excised from pIMS7.2 by digestion with 

Hindlll and Smal, and cloned into Hindlll and 

Hpal-digested plMMC4a to generate pIMS12 (figure 

15). 

A DNA fragment containing the modified HPV18 E6-7 
25 sequence is excised from pIMS8.2 by digestion with 

Ncol and Smal , and cloned into Ncol and 
Smal-dig sted pIMMC14b to generate plMS14 (figure 
15). 
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ii) Preparation of a plasmid vector containing both 
HPV16 and HP VI 8 E6-7 sequences together with th ir 
upstream vaccinia promoters , 

A DNA fragment containing the HPV18 E6-7 region 
5 together with the upstream p7.5 promoter is 

excised from pIMS!4 with Sail and Smal and 
inserted into Sail and Smal-digested pIMS12 to 
generate pIMSIS (figure 15) 

iii) insertion of the HPV E6-7/promoter "double" 
10 cartridge into the neutral site containing 

plasmids . 

A DNA fragment containing both the HPV16 and HPV18 
E6-7 coding regions together with their upstream 
promoter elements is excised from pIMSIS with 
15 SnaBl and inserted into the appropriately-digested 

neutral site-containing plasmids pIMMC7a-d. This 
step is shown in figure 16 , and the resulting 
plasmids are designated pIMMC9a-d. 

iv) Introduction via homologous recombination of the 
20 neutral site DNA, together with the intervening HPV 

sequences, into the vaccinia virus genome to create 
a recombinant virus expressing the two modified HPV 
E6-7 sequences . 

The recombinant plasmids pIMMC9a-d are purified 
25 and allowed to recombine into vaccinia (figure 17) 

using standard protocols (Mackett et al., in D.M. 
Glover (ed) DNA Cloning: a Practical Approach, 
Oxford and Washington DC, IRL Press, 1985). 
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Viruses which have acquired the HPV sequences are 
identified by pr bing with radiolabeled HPV 
specific sequences. Viral plaques are lifted onto 
nitrocellulose (Villareal and Berg, Science 196, 
183, 1977) and probed with radiolabelled Ncol-Smal 
fragment from pIMS14 containing the HPV 18 B67 
gene. Recombinant viruses are then isolated from 
the agarose overlay and plaque purified three 
times. They are checked for the presence of the 
appropriate DNA sequences by Southern blotting of 
purified virus DNA using DNA probes derived from 
the HPV E6 and E7 genes, and for expression of the 
appropriate sequences by western blotting using 
antisera specific for the HPV £6 and E7 proteins. 

Cloning of the therapeutic virus in MRC5 cells 
Stocks of the final recombinant virus are prepared 
by growth in Vero cells, and are used to infect 
MRC5 cells deemed suitable for the preparation of 
material suitable for use as human vaccines. The 
virus is plaque-purified three times by standard 
methods, and finally a stock prepared for clinical 
use. 

Confirmation of presence of the correct HPV DNA 
insert 

A sample of this stock virus is checked once again 
for the presence of correctly configured virus 
DNA , and for expression of the correct virus 
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proteins- Figure 22 sh ws the analysis by PCR, f 
a recombinant vaccinia virus (v9a.l) in which the 
HPV DNA cassette is inserted at Site A. The 
diagram shown in panel (a) indicates the DNA 
fragments expected if insertion of the correct DNA 
has occurred. It can be seen in panel (b) that 
the actual pattern of PCR products generated is 
consistent with that expected. 

Confirmation of expression of the HPV DNA insert 
The recombinant viruses are then checked for 
expression of the expected HPV proteins . An 
example of this analysis is shown in Figure 23- 
Vero cells are infected with with recombinant 
virus v9a.l (HPV DNA inserted at Site A), and the 
cells examined by western blotting for the 
presence of the HPV E67 fusions proteins using 
monoclonal antibodies specific for the HPV16 E7 
protein (camvir3) and for the HPV18 E7 protein 
( 7E10 ) . It can be seen that both monoclonal 
antibodies recognise specifically proteins of the 
expected size in cells infected with the 
recombinant virus v9a.l, but not in cells infected 
with the control parent virus Wyeth strain. These 
recognised proteins co-migrate with proteins 
synthesised by in vitro translation of mRNA 
encoding the expected HPV fusion proteins (HPV 16 
E67 and HPV18 E67). This experiment indicates 
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successful expression of the heterologous gene 
sequences from the recombinant virus. 
Stability of the HPV DNA Insert 

For the recombinant virus to be of use clinically, 
It Is Important that the Inserted sequences remain 
genetically stable over multiple virus passage, 
and the DNA insert was carefully designed to 
promote this genetic instability. To confirm the 
stability of the HPV information within the 
recombinant virus genome, the virus is subjected 
to 9 serial passages, (multiplicity of infection 
«10pfu/cell) in Vero cells. Subsequently 20 
plaque isolates are picked, and analysed for the 
presence of the correct HPV DNA insert by PCR 
analysis as described in Figure 22. The data 
obtained for recombinant virus A are shown in 
Figure 24. All 20 virus isolates retained the HPV 
information in the expected genetic arrangement 
indicating a considerable degree of genetic 
stability. 

Animal experiments 

The virulence of the recombinant virus is compared 
in animal experiments with that of the parental 
Wyeth strain. Groups of 20 mice are inoculated 
intranasally each with 10 7 pf u of Wyeth strain or 
recombinant virus in a total volume of 20|il. Two 
mice are sacr 1 f iced at 1 day , 3 days and 5days 
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following inoculation, and th lungs dissected 
out. The amount of virus present in the lungs is 
then measured by grinding the tissue, and assay of 
the homogenate by standard vaccinia virus plaque 
5 assay. The results of such an experiment for the 

recombinant virus v9a.l (HPV infromation inserted 
at site A) are shown in Figure 25. It can be seen 
that the recombinant virus retains the ability to 
replicate in mice, and that the level of virus 
10 produced in the lungs of the infected animals is 

similar to that seen with the parental Wyeth 
strain. 

Therapeutic Use 

A stock of the recombinant virus is prepared by 
15 infection of MRC5 cells, and adjusted to a 

concentration of not less than 10 s pfu/xnl. 20 ]il 
of this material is applied to the arm of the 
patient, which is then scarified through the virus 
droplet with a bifurcated needle, according to the 
20 standard procedure used for vaccination against 

smallpox. 
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CLAIMS 

1. A recombinant virus vector for use as an 
immuno therapeutic or vaccine which comprises at least 
one pair of nucleotide sequences heterologous to said 
5 virus which have sufficient sequence homology that 
recombination between them might be expected; 

wherein said pair of nucleotide sequences are 
arranged in said virus vector such that they are 
Inverted with respect to each other to reduce the 
10 likelihood of recombination events leading to loss of 
part or all of said sequences and said virus vector is 
able to Infect a mammalian host cell and express as 
polypeptide the heterologous nucleotide sequences in 
said host cell. 

15 2. A recombinant virus vector according to claim 1 
wherein at least one pair of nucleotide sequences 
encode part or all of human papillomavirus (HPV) wild- 
type proteins or mutant proteins immunologically 
cross-reactive with said wild-type proteins. 

20 3. A recombinant virus vector according to claim 1 
or claim 2 wherein the pair of nucleotide sequences 
encode part or all of the HPV wild-type proteins 
HPV16E7 and HPV18E7 or mutant proteins immunologically 
cross-reactive ther with. 
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4. A recombinant virus vector according to claim 1 
or claim 2 wherein the pair of nucleotide sequences 
encode part or all of the HPV wild-type proteins 
HPV16E6 and HPV18E6 or mutant proteins immunologically 

5 cross-reactive therewith. 

5. A recombinant virus vector according to claim 3 
which comprises a further pair of nucleotide sequences 
which encode part or all of the HPV wild-type proteins 
HPV16E6 and HPV18E6 or mutant proteins immunologically 

10 cross-reactive therewith. 

6. A recombinant virus vector according to any one 
of the preceding claims wherein two or more nucleotide 
sequences of different said pairs may be fused 
together to form a single open reading frame. 

15 7. A recombinant virus vector according to claim 6 
wherein the fusions are via a single codon encoding a 
small neutral amino acid. 

8. A recombinant virus vector according to claim 7 
wherein the amino acid is glycine. 

20 9. A recombinant virus vector according to claim 5 
in which the pairs of nucleotide sequences are 
arranged in the virus vector according to any one of 
the options as shown in Figure 26. 
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10. A recombinant virus vector according to claim 9 
which comprises 

a first open reading frame having a fused genetic 
sequence encoding part or all of the wild-type 
5 proteins E6 and E7 from HFV16; and 

a separate second open reading frame having a 
fused genetic sequence encoding part or all of the 
wild- type proteins E6 and E7 from HPV18; > 

wherein the first and second open reading frames 
10 may be Inverted with respect to one another whereby 
either: 

i) the E6 coding sequences of HPV16 and HPV18 
are both located between the E7 coding sequences of 
HPV16 and HFV18; or 
15 ii) the E7 coding sequences of HPV16 and HPV18 

are both located between the E6 coding sequences of 
HPV16 and HFV18; and 

wherein any of said wild-type proteins may be 
replaced by a mutant protein immunologically cross - 
20 reactive therewith. 

11. A recombinant virus vector according to claim 10 
wherein each of the first and second open reading 
frames has a corresponding promoter and the two open 
reading frames each with its promoter , are arranged 

25 next to each other in the virus. 

12. A recombinant virus vector according to claim 11 
wherein either: 
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±) the promoters are located between th first 
and second open reading frames whereby the open 
reading frames are transcribed outwardly; or 

11) the promoters are located outside the first 
5 and second open reading frames whereby the open 
reading frames are transcribed inwardly. 



13. A recombinant virus vector according to any one 
of claims 9 to 12 which comprises 

a first open reading frame having a fused genetic 
10 sequence encoding part or all of the wild-type 
proteins E6 and E7 from HPV16; and 

a separate second open reading frame having a 
fused genetic sequence encoding part or all of the 
wild-type proteins E6 and E7 from HP VI 8; 
15 wherein the E6 coding sequences of HP VI 6 and 

HPV18 are both located between the E7 coding sequences 
of HPV16 and HPV18; and 

each open reading frame has a corresponding 
promoter, the promoters being located between the 
20 first and second open reading frames whereby the open 
reading frames are transcribed outwardly; and 

wherein any of said wild-type proteins may be 
replaced by a mutant protein immunologically cross- 
reactive therewith. 



25 



14. A recombinant virus vector according to any one 
of the preceding claims wherein either or both of the 
nucleotide sequences in a said pair of nucl otid 



WO 92/16636 PCT/GB92/00424 

47 

sequences are altered to make them less homologous 
than an equivalent pair £ nucleotide sequences 
encoding wild- type HPV proteins. 

15. A recombinant virus vector according to claim 14 
5 wherein the alteration in nucleotide sequence does not 

result in an alteration of the encoded amino acid 
sequence. 

16. A recombinant virus vector according to claim 3 
wherein the wild-type proteins HFV16E7 and HPV18E7 are 

10 replaced with mutant proteins which are substantially 
homologous to said wild-type proteins and in which the 
residues cys 24 and glu 26 of wild-type protein 
HPV16E7 and the residues cys 27 and glu 29 of wild- 
type protein HPV18E7 are replaced with glycine 

15 residues. 

17. A recombinant virus vector according to any one 
of the preceding claims wherein said heterologous 
nucleotide sequences may comprise part or all of the 
sequences shown in Figures 1(a) and 1(b). 

20 18. A recombinant virus vector according to any one 
of the preceding claims which is derivable from 
vaccinia virus. 

19. A recombinant virus vector according to any one 
of th preceding claims wherein the nucleotide 
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sequences are inserted into the virus vector at one or 
more neutral sites, the disruption of which by the 
insertion of the nucleotide sequences does not 
substantially adversely affect viral functions 
5 relating to the replicative ability of the virus in 
the mammalian cell. 



20. A recombinant virus vector according to claim 19 
which is derivable from vaccinia virus and wherein the 
neutral sites may be one or more of: 
10 A) the gap between SalIF17R and SalIF19R of 

strain WR comprising at least part of the sequence 
CTATCTACCAGATTATTATGTGTTATAAGGTACTTTTTCT ; 

B) the gap between SalIF19R and SalIF20.5R of 
strain WR comprising at least part of the sequence 

15 TATTGTGCTACTGATTCTTCACAGACTGAAGATTGTTGAA ; 

C) a region in SalIG2R of strain WR comprising 
at least part of the sequence 

TCTCTTAAAATGGTTGAGACCAAGCTTCGTTGTAGAAACA ; 

D) a region in HindB3.5R of strain WR comprising 

20 at least part of the sequence 

TGAGGCTACCTCGACATACGTGTGCGCTATCAAAGTGGAA ; 

E) a sequence having at least 90% sequence 
homology to those sequences A) to D) identified above. 



21. A method for making a recombinant virus vector 
25 according to claim 19 or claim 20 which comprises 
ins rting a said heterologous nucleotid sequences into 
one or more neutral sites in a virus vector, th 



WO 92/16636 



PCT/GB92/00424 



49 

disruption of such a sit by said insertion will not 
substantially adversely affect the replicative ability 
of the virus and wherein the neutral site has been 
previously identified by: (a) analysing a viral genome 
5 to identify open reading frames which are likely to 
encode functional genes; and (b) selecting sites between 
open reading frames for functional genes or sites within 
sequences for non- functional genes. 

22. A recombinant virus vector obtainable by the method 
10 of claim 21. 

23. A method which comprises using a recombinant virus 
vector according to any one of claims 1 to 20 or to 
claim 22 to manufacture a medicament for use as an 
immunotherapeutic or vaccine against a condition thought 

15 to be caused by HPV infection , for example cervical 
cancer. 

24. A method which comprises using a recombinant virus 
vector according to any one of claims 1 to 20 or to 
claim 22 to specifically activate cells of the immune 

20 system to HPV proteins. 
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Fig.la. 

ATCCCATGGACCAAAAGAGAACTGCAATGTTTCAGGACCCACAG^ 
IPWTKRELQCFRTHRSDPES 
SHGPKENCNVSGPTGATQKV 
P MDOKRTAMFOPPOE RPRKT. 

A Start of E6 coding region 



TACCACAGTTATGCACAGAGCTGCAAACAACTAra^ 120 
YHS YAQS CKQLYMI * Y * N V C 
TTVMHRAANNYT* YNI RMCV 
POLtTTELOTTTHn iTT. pry Y 

ACTGCAAGCAACAGTTACTGCGACGTGAGGTATATGACTTTGC^ 180 
TASNSYCDVRYM TLLFGIYA 
LQATVTAT*GI *LCFSGFMH 
CKOOI.T, RR'YYnF AFRPLrT 

TAGTAXATAGAGATGGGAATCCATATGCTGTATGTGATAAAT 240 
* Y I EMGI HMLYVI N V * S FI L 
SI *RWESICCM**MFKVLF* 
VYRnffWPYAVC-PTcrT.yyyc^ 

AAATTAGTGAGTATAGACATTATTGTTATAGTTTG^ 300 
KLVSIDIIVIVCMEQH*NSN 
N**V*TLLL*FVWNNIRTAI 

ISEYRHYCY SLYfiTTLEOOY 

ACAACAAACCGTTGTGTGATTTGTTAATTAGGTGTATXAACTGTCAAAAGCC^ 360 
TTNRCVIC*LGVLTVKS HCV 
QQTVV*FVN*VY*LSKATVS 

■NKPLCPLLIRgTHC OTCPT r p 

CTGAAGAAAAGCAAAGACATCTGGACAAAAAGCAAAGATTCCATAATATAAGGGGTCG^ 420 
LKKSKDIWTKSKDSII-GVG 
* RKAKTSGQKAKI? * YKGSV 
EEKORHLDKFOP.F HMTR^RW 

GGACCGGTCGATGTATGTCTTGTTGCAGATCATCAAGAACACGTAGAGAAACCCAGCTGT - 480 

GPVDVCLVADHQEHVEKPSC 
CRSMYVLLOI IKNT*RNPAV 
TGRfTMCCCRgg RTRRET^I. * 
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Fig.la(Cont). 

AATCATGCATGGAGATACACCTACA^ 540 
NHAWRYTYIA* IYVRFATRD 
I MH^DTPTT. HEYMT, DT. QPET 
SCMEIHLHCMNIC* ICNQRQ 
~ Start of E7 coding region 

AACTGATCTCTACTGTTATGAGCAATTAAATGACAGCTCAGAG 60C 
N*SLLL*AIK*QLRGGG*NR 
TDT.YCYEQLIinSfSEF EDPTn 
LI STVMSN*MTAQRRRM K* M 

TGGTCCAGCTGGACAAGCAGAACCGGAC^GAGGCCATTACAATATTCT 660 

WSSWTSRTGQSPLQYCNLLL 
GPAG OAEPDRA HYNTVTr rr 

VQLDKQNRTEPITI L * PFVA 



CAAGTGTGACTCTACGCTTCGGTTGTGCCT 
QV*LYASVVRTKHTRRHSYF 

KCnsTT,TiLrvosrw vnTPT T l 

SVTLRFGCAYKAHT* TFVLW 

GGAAGACCTGTTAATGGGCACACTAGGAATTGT^ 

GRPVNGHTRNCVPHLFSETI 
— EDT, T. MfiTLGT yroTCSnftP * 

KTC*WAH* ELCAPSVLRNHN 

ACCCGGGTGA 
T R V 
P G * 
P G 



720 



780 



840 
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Fig.1t. 

ATCCCATGGCGCGCTTTGAGGATCCAACACGGCGACCCTACAAGC^^ 60 
IPWRALRIQHGDPTSYLICA 
SHGAL*GSNTATLQAT* SVH 
P MARFEDPTRRPYKLPDT, TT 

A Start of E6 coding region 

CGGAACTGAACACTTCACTGCAAGACATAGAAATAACCTGT 120 
RN * TLHCK"T*K* PVY IARQY 
GTEHFTARHRNNLCILQDS I 
ELNTSLODIEITCVYCKTVT. 

TGGAACTTACAGAGGTATTTGAATTTGCATTTAAAGATTTATTTC 1 80 
WNLQRYLNLHLKIYLWCIET 
GTYRGI*ICI*RFICGV*RQ 
ELTEVFEFAFKDLFV VYRnS 

GTATACCCCATGCTGCATGCCATAAATGTATAGATTTTT 240 
VYPMLHAINV*IFILELEN* 
YTPCCMP*MYRFLF*N*RIK 
IFHAACffKCTDFYS'R TRET. R 

GAGATTATTCAGACTCTGTGTATGGAGACACATTGGAAAAA 300 
DI IQT LCMETHWKN*LTLGY 
TLFRLCVWRHIGKTN* HWVI 
HYSDSVYGDTLEKLTNTGLY 

ACAATTTATTAATAAGGTGCCTCCGGTGCCAGAAACCGTTGAATC 360 
T I Y * *G ACGARNR*IQQKNL 
QF I NKVPAVP E TVE S S R K T * 
NT. L TRET. RCOKPLNP AEKT. R 

C^CACCTTAATGAAAAACGACGATTTCACA^ 420 
DTLMKND DFTT* LGTI EASA 
TP * *KTTISQHSWAL* RPVP 
HLMFKRRFHWT AgHYRfinrH 

ATTCGTGCTGCAACCGAGCACGACAGGAACGACTCCAACGACGCAGAGAAACACAA 480 
IRAATEHDRNDSNDAEKHKY 
FVLQPSTTGTTPTTQRNTSI 
SCCNRAROFP TORRRFTOV* 
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Fig.lb(Cont). 



AATATTAAGTATGCATGGACCTAAGGCAACATTGCAAGACATTGTATTGCATTTAGAGCC 540 
NI KYAWT*GNIARHC IAFRA 
I L S MHGPKATLODIVLHT.FP 
Y*VCMDLRQHCKTLYCI*SP 

* Start of E7 coding region 

CCAAAATGAAATTCCGGTTGACCTTCTATGTCACGAGCAATO 600 
PK*NSG*PSMSRAIKRLRGR 
OHEIPVPLLCHEnT. SDSFFP 
KMKFRLTFYVTSN-ATQRKK 

AAACGATGAAATAGATGGAGTTAATCATCAACATTTACCAGCCCGACGAGCCGAACCACA 660 
KR*NRWS*SSTFTSPTSRTT 
N D F I D GVNHQHLPARRAFPO 
TMK*MELI INIYQP D E P N H N 



ACGTCACACAATGTTGTGTATGTGTTGTAAGTGT^ 720 

TSHNVVYVL*V*SQN*ASSR 

R H TMLCMCCKCF A P T F L V V F 

VTQCCVCVVSVKPELS * * * K 

AAGCTCAGCAGACGACCTTCGAGCATTCCAGCAG C ^ 780 
KLSRRPSSIPAAVSEHPVLC 
SSADDLRAFOOLFT, NTLSFV 

AQQTTFEHSSSCF*TPCPLC 

GTGTCCGTGGTGTGCATCCCAGCAGTAACCCGGGTGA 840 
VSVVCIPAVTRV 
CPWCASOO* P G * 
VRGVHPSSNPG 
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Fig. 3. 
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Fig A. 
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Fig. 5. 
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Fig. 7. 
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Fig. 9. 



SolFI7R 

GRTTGTGTGCRTGCGTGGTAGATGTTTGGRGRAflTGAGflflACTGTTTTCTRGRTGGAflflT 501 
LCflCUUDUURNEKLFSfiUKV 

flTTGTTTflCGflCCTRTTflfiflCTCTTTflTTRflTGRTCflCflTGCTTGflTflflCflTflflflflTCTfl 561 
CLRRIKLFIHOHtlLOKIKSI 

TRCTGCflGflflTflGflCTflGTflTflTGTGGflflflTGTCRTflGflflRGTTRflflflGTTflflTGflGflGC 621 
LQHRLUVUEI1S* 

flflflflRTRTflTflflGGTTGTfiTTCCflTflTTTGTTRTrrrrrcrGTflflTflGTTflGflflflflflTflC 661 

Site fl 

RTTCGflTGGTCTflTCTflCCflGflTTflTIfllim.TTflTflflGGTflCrrrrrcrCflTnflTflRflC 741 



TflGHGTflTGRGTflHGflTflGTGTTTTTCflflflACflTflTflflflTCTflflAflTTGflTGGflTGAGflT 



TGGRTRTflflflGflflTTTGCTGflCTGCflTGTflCTflTTTTTTflCflTTflCTflCflTTGGCTRCGG 
GV KEFfiDCnVVFLHVVlGVG 



801 



flTRCflGCTflTTflflTTTCGRflflflTRTRTTTTRflTCTGflTflflCTTTflflflCflTGGflTTTTTGfl 861 

SalF19R 

TGGTGGTTTflflCGTTTTflflflflflflflGRTTTTGTTRTTGTflGTflTflTGflTflflTflTTflflflflGfl 921 

0 I I L K 0 



981 
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Fig. 10. 

SalF20R 

TGCCflRGGTTRGATGTRflTGGTARCGRTflRCRCflRflflTGTGflRCGCTGCCCflCCTCRTRC 1 1 40 

RKURCNGNOHTICCE RCPPHT 

flTflTflCCRCflflTCCCflflTTflTTCTflflTGGflTGTCfl.TCflflTGTRGflflflRTGCCCflflCCGGfl 1200 
VTTIPI ILflOUIHUENRQPO 

TCRTTTGATflRGGTRRRGTGTRCCGGRRCRCflGRRCRGTflRATGTTCGTGTCTTCCTGGT 1260 
H L I R • 

Site 6 

TGGTRTTGTGCTflCTGflTTCmatnGflCTGflflGflTTGTTGflRflTTGTGTflCCflflRRflGG 1 320 

SolF20.5R 

flGflTGTCCflTGCGGflTflCTTTGGTGGflflTRGflTGflBCflflGGflflflTCCTflTTTGTflflflTCG 1 360 

flNKEILFUHR 

TGTTGTGTTGGtGflflTRTTGCGflCTflCCTflCGTflflTTflTflGflCTTGflTCCflTTTCCTCCR mo 
UULUHIRTTVUI IDLIHFLH 

TGCRflflCTftTCTRflflTGTflflTTflflTTRTGflTTTTGflTGRTflflTGTTflCCflTflCflTTflTflT 1500 
RHVLHUIHVOFDONUTIHVI 

CGCTflCTTGGTTflGTGTRTTflTTCflGTflTGflflGRCCTflTTflfiTflflTTflCTTflTCTTTTGfl 1 560 

RTULUVVS U* 

CGflTCTTGTTflTflflTTflTflflTflTflRflRflTflCTTflTGGCflTRGTflflCTCflTflflTTGCTGflC 1 620 
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Fig. 11. 



Gap Uetght. 3.000 Average Hatch* 0.510 

Length Ueight. 0.100 fluerage tlisnatch. -0.396 

Quality. 113.0 Length. 199 

Ratio. 0.608 Gaps. 4 

Percent Sisilority. 56. 354 Percent Identity-. 29.282 

The top sequence is SalG2R and the bottom sequence is the yeast GK 

Start of G2R 

v 

1 nSGIUKSI ILS GPSCLGKTfi lfllCRLnG I V . . . L0LUCP I PLDFLULflERE 47 



1 SRP I U IS GPSGTGKST LLKgLFREVPDSFGFSUSSTTBTPRflGEUN 16 

46 GUDVH VUNREfl I UKG I RAGNFLEHTEFLGN I VGTSKTAUNTRfl I NNR I CU 97 
•**. • *• ••««• * , 

47 GKOVNFUSUDEFKSniKNNEFIEURQFSGNYYGSTURSUKQUSKSGKTCI 96 

98 MDLN 1 0GURSLKH . TVLnPVSUV I RPTSLKHUETKLRCRNTERNDE I HRR 1 46 
97 LD I OtlQGUKSUKR I PELNRRFLF I RPPSUEOLKKRLEGRGTETEES I NKR M6 

147 U I LAKTDtlDERNERGLFDT 1 1 1 EO0UNLRYSKL 1 0 . I LQOR I RflVFNTN 191 
147 LSRRQRELRVR . ETGHHDKU I UNDOLDKflVKELKDF I FflEK 186 
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Fig. 12. 



HlndB3R 

CTAAGAAGVCGTATACCGCACCAGCTTCCTTTATACTCTGVTCTTTTACCAACACAAAGG 507 
LRTRIRQQLPLYSHLLPTQR 

GTGGATATTTGTTCATTG6AGTTGATAATAATACACACAAAGTAATTGGATTCACGGTGG 567 
VDICSLELIIIHTK* 

GTCATGACTACCTCAGACTGGTAGAGAATGATATAGAAAAGCATATCAAAAGACTTCGTG 627 

TTGTGCATTTCTGTGAGAAGAAAGAGGACATCAAGTACACGTGTCGATTCATaAGGTAT 687 

Site D 

ATAAACCTGGGGATGAGGCTACCTCGACAIA^LEfiJGCGCTATCAAAGTGGAAAGATGCT 747 

GTTGTGCTGTGTTTGCAGATTGGCCAGAATCATGGTATATGGATACTAATGGTATCAAGA 807 



AGTATTCTCCAGATGAATGGGTGTCACATATAAAATrTTAATTAATGTAATAGAGAACAA 867 



ATAATAAGGTTGTAATATCATATAGAaATAACTAACAATTAATTACTAACTGTTATCTC 927 



TTTTTTAACTAACCAACTAACTATATACCTATTAATACATCGTAATTATAGTTCTTAACA 987 



TCTATTAATCATTAATTCGCTTCTTTAAT7T7TT47AAACTAACATTGTTAATT6AAAAG 1047 

HindB4R 

GGATAACATGTTACAGAATATAAATTATATATGGATTTTTTTAAAAAGGAAATACTTGAC 1107 

MOFFKKEILD 

TGGA GTATATATTTATCTCTTCATTATATA G CACG C GTGTTTTC CA ATTTTTC CA CA TC C 1167 
WSIYLSLH... YIARVFSNFSTS 
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Fig.16. 
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Fig. 17. 
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Fig. 18. 

MB03 TCAGGATCCCACATGAGCGAAAAATACATCG 
MB07 TCAAAGCTTATTACGATACAAACTTAACGGA 
MB09 TCAGTTAACATAAAAAGAACAACGCCCGGCAG 

MB14 TCAAGGGCCTCTATATAGTAATACCAATACTC 

MB15 TCAGTCGACTTACAAACAACTAGGAAATTGG 

MB1 6 TCAGAATTCTATGTACAGAGGTCTATTAGGC 

MB17 TCAAAGCTTGTATGAGGTGGGCAGCGTTCAC 

MB18 TCAGAATTCTTAATTATATTGTCGGCCGTGG 

MB19 TCAGCATGCATGATCCGTTAGCTTTGGGCTC 

MB22 TCAGAATTCGAAGCTCTAGAGTATCTTAGCG 

MB23 TCAAAGCTTTCCTGTATTATATGGGATGTGG 

MB24 TCAGAATTCATTGATGGATGAGATATACAGC 

MB25 TCAAAGCTTTCACAAAATCG 

MB26 TCAGAATTCCACGTATACGGCAGCAGCTTCC 

MB27 TCAAAG ClTrGTlXrr ACGTCCATTT TCA AGC 

MB32 TCAGTCGACATACCAATACTCAAGACTACGA 

MB33 TCACCATGGATTGCTATTGATTGAGTACTGTTC 

MB35 AGTACCTTATAATACGTAATAATCTGGTAG 

MB36 AATCTTCAGTCTGTTAACAATCAGTAGCAC 

MS37 TACAACGAAGCTAGGCCTCAACCATTTTAA 

MB38 TTTGATAGCGCATACGTATGTCGAGGTAGC 

MB39 AATAGGTATATAGTTAACTGGTTAGTTAAA 

501 ATCCCATGGCGCGCTTTGAGGATCCAAC 

502 TCACCCGGGTTACTGC7GGGATGCACACCAC 

505 ATCCCATGGACCAAAAGAGAACTGCAATGTTTC 

506 TCACCCGGGTTATGGTTTCTGAGAACAGATG 

507 AGCTCTACGTAGTTAACAAGCTTGTCGACCCATGGCCCGGGTACGTA 

508 AATTTACGTACCCGGGCCATGGGTCGACAAGCTTGTTAACTACGTAG 

520 GAAACCCAGCTGGGAATCATGCATGG 

521 GAAACACAAGTAGGAATATTAAGTATG 

522 GATCTCTACGGTTATGGGCAATTAAATGAC 

523 GACCTTCTAGGTCACGGGCAATTAAGCGAC 

524 ATGTATAGATTTCTACAGTAGAATCAGAGAATTAAG 
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Fig. 19. 



TATTGTTTACGAGCTATTAAACTGTTTATTAATGATCACATGCTTGA go 
YCLRAIKLFINDHMLDKIKS 
IVYELLNCLLMITCLIR*NL 
LFTSY*TVY**SHA**DKIY 

ATACTGCAGAATAGACTAGTATATGTGGAAATGTCATAGAAAGTT 120 
ILQNRLVYVEMS*KVKS**E 
YCRID *YMWKCHRKLKVNES 
TAE*TSICGNVIES*KLMRA 

CAAAAATATATAAGGTTGTATTCCATATTTGTTATTTTT^ 180 

QKYIRLYSIFVIFSVIVRKI 
KNI *GCIPYLLFFL* * L E K Y 
KIYKVVFHICYFFCNS * K N T 

CATTCGATGGTCTATCTACCAGATTATTATGTOT 240 
HSMVYLPDYYVL*GTFSHNK 
IRWSIYQIIMCYKVL-FLIIN 
FDGLS TRLLCVIRYFF S * * T 

CTAGAGTATGAGTAAGATAGTGTTTTTCAAAACATATAAATCT 300 
LEYE»DSVFQNI*I*N*WMR 
* SMS K IVFFKT YKS K I DG* D 

RV*VR*CFSKH INLKLMDEI 

TATACAGCTATTAATTTCGAAAATATATTTTAATCTGATAACTTT 360 
YTAINFENIF*SDNFKHGFL 
IQLLI SKIYFNLITLNMDF* 
YSY*FRKY ILI * ♦ L * TWIFD 

ATGGTGGTTTAACGTTTTAAAAAAAGATTTTGTTATTGTACT 420 
MVV*RFKKRFCYCSI**Y*K 
WWFNVLKKDFVIVVYDNIKR 
GGLTF*KKILLL*YMI ILKD 

ATGGATATAAAGAATTTGCTGACTGCATG7ACTATTTTTTACACT 480 

MDIKNLLTACTIFYITTLAT 
WI*RIC*LHVLFFTLLHWLR 
GY K EFADCMYYFL HY Y 1 GYG 

GCAGATATACCTACTCCGCCACCAACGGGTC^TGT 54 q 

AD IPTPPPTGHVTRENILIR 
QIYLLRHQRVM*QGRIS**E 
RYTYSATN GSCDKGEYLDKR 

GGCATAATCAATGTTGTAATCGGTGTCCACCTGGAGAATTTGCC^ 600 
G I I NVVIGVHLENLPRLDVM 
A* SML*SVSTWRICQG*M*W 
HNQCCNRCPPGEFAKVRCNG 

GTAACGATAACACAAAATGTGAACGCTGCCCACCTCATACATATAC 660 
VTITQMVNAAHLIHIPQSQL 
* R* HKM*TLPTSYIYHNPNY 
NDNTKCERCPPHTYTTIPII 
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Fig. 19(Cont). 

TTCTAATGGATGTCATCAATCTAGAAAATG^ 720 
F*WMSSM*KMPNRII**GKV 
SNGCHQCRKCPTGSFDKVKC 
LMDVIN VENAQPDHLIR*SV 

TACCGGAACACAGAACAGTAAATGTTCGT^^ 780 
YRNTEQ*MFVSSWLVLCY*F 
TGTQNSKCSCLPGWYCATDS 
PEHRTVNVRVFLVGIVLLIL 

TTCACAGACTGAAGATTGTTGAAATTGTGTACCAAAAAGGAGATGrc 840 
FTD*RLLKLCTKKEMSMRIL 
S Q T E D C * N CVP K RRC P C G Y F 
HRLKIV.EIVYQKGDVHADTL 

TGGTGGAATAGATGAACAAGGAAATCCTATTTGTAAATCGTGTTGTGTTG^ 900 
WWNR*TRKSYL*IVLCW*IL 
GGIDEQGNPICKSCCVGEYC 
VE*MNKE ILFVNRVVLVNIA 

CGACTACCTACGTAATTATAGACTTGATCCATTTC 960 
RLPT*L*T*SISSMQTI*M* 
DYLRNYRLDPFPPCKLSKCN 
TTYVIIDLIHFLHANYLNVI 

TTAATXATGATTTTGATGATAATCTTACCATACAT1ATATCGCXA 1020 
LIMILMIMLPYI ISLLG*CI 
*L*F***CYHTLYRYLVSVL 
NYDFDDNVTI H Y IATWLVYY 

ATTCAGTATGAAGACCTATTAATAATTACTTATCTTCT 1080 
IQYED L LIITYLLTILL*L* 
FSMKTY* *LLIF*RSCYNYN 
SV*RPINNYLSFDDLVIIII 

TATAAAAATACTTATGGCATAGTAACTCATAATTG CTGACGCGATAAATTCGTAATAATC 1140 
YKNTYGIVTHNC * R D K F V I I 
IKILMA**LIIADAINS**S 
*KYLWHSNS W LLTR*-IRNNL 

TGTTTTGTTCAAATTTTTATAAGGAATCTACAGGCATAAAAATAAAAATATA^ 1200 
CFVQIFIRNLQA*K*KYNL* 
VLFKFL*GIYRKKNKNIIYN" 
FCSNFYKESTGIKIKI * F I I 

TATACTCTTACAGCGCGCCATCATGAATAACAGCAGTGAATTGATTGCT 1260 
YTLTARHHE*QQ*IDC 
ILLQRAIMNNSSELIA 
YSYSAPS*ITAVN*LL 
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Fig. 20. 

ATATTTGGTATTACCGCATTAATTATATTGTCG^ 60 
IFGITALI ILSAVAIFCITY 
YLVLPH*LYCRPWQFS V LHI 
IWYYRINYIVGRGNFLYYIL 

TATATATATAATAAACGTTCACGTAAATACAAAACAGAGAAC^ 120 
YIYNKRSRKYKTENKV* IFD 
IYIINVHVNTKQRTKSRFLT 
YI**TFT*IQNREQSLDF*L 

TTACATAAATGTCTGGGATAGTAAAATCXATCATATTGAGCGGACCATCT 180 
LHK CLG* *NLSY*ADHLV* E 
YINVWDSKIYHIERTIWFRK 
T*MSGIVKSIILSGPSGLGK 

AGACAGCCATAGCCAAAAGACTATGGGAATATATTTGG A TT TC 240 
RQP*PKDYGNIFGFVVSH TT 
DSHSQKTMGIYLDLWCPIPL 
TAIAKRLWEYIWICGVPYH* 

AGATTTCCTCGTCCTATGGAACGAGAAGGTGTItaTTACCA 300 
RFPRPMEREGVDYHYVNREA 
D FLVLWNEKVLITI TLTERP 
ISSSYGTRRC*LPLR*QRGH 

ATCTGGAAGGGAAraGCCGCCGGAAACTTTCTAGAA 360 
IWKGIAAGN FLEHTEFLGNI 
SGRE*PPETF*NILSF*EIF 
LEGNSRRKLSRTY*VFRKYL 

TACGGAACTTCTAAAACAGCTGTGAATACAGCGGCTAT^ 420 
YGTSKTAVNTAAINNRICVM 
TELLKQL * IQRLLIIVFV*W 
RNF*N SCEYSGY**SYLCDG 

GATCTAAACATCGACGGTGTTAGAAGTCTTAAAAATACGTACCTAATGCCTTACTCGGTC 480 
DLNIDGVRSLKNTYLMPYSV 
I*TSTVLEVLKIRT*CLTRC 
SKHRRC* KS*KYVPNALLGV 

TATATAAGACCTACCTCTCTTAAAATGGTTGAGACCAAGCTTCGTTGTAG 540 
Y I RPTSLKMVETKLRCRNTE 
I *DLPLLKWLR?SFVVETLX 
YKTYLS*NG*DQASL*KH*S 
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Fig.20(Cont). 

GCTAACGATGAGATTCATCGTCGCGTGATATTGGCAAAAACGGATATGGATGAGGCCAAC 600 
ANDE I HRRVILAKTDMDEAN 
LTMRFIVA *YWQKRIWMRPT 
*R*DSSSRDIGKNGYG*GQR 

GAAGCAGGTCTATTCGACACTATTATCATTGAAGATGATGTGAAT^ 6 60 

EAGLFDTIIIEDDVNLAYSK 
KQVYS TLLSLKMM*I *HIVS 
SRSIRHYYH*R*CEFSI*"V 

TTAATTCAGATACTACAGGACCGTATTAGAATGTATTTTAACACTAATTAG^ 720 
LIQILQDRIRMYFNTN*RLK 
*FRYYRTVLECILTLIRDLR 
NSDTTGPY*NVF*H*LET* D 



ACTTAAAACTTGATAATTAATAATATAACTCGTTTTO 780 
T * NLI I NN I TRF YMWLF QRL 
LKT**LII*LVFICGYFNV* 
LKLDN**YNSFLYVAISTSN 

ATGTATTAGTTAAATATTAAAACTTACCACGTAAAA^ 840 
MY*LNIKTYHVKLKI*NDIS 
CIS*ILKLTT*NLKFKMIFH 
VLVKY*NLPRKT*NLK* YFI 

TTGACAGATAGATCACACATTATGAACTTTCAAGGACTT^ 900 
LTDRSHIMNFQGLVLTDNCK 
*QIDHTL*TFKDLC*LTIAK 
DR*ITHYELSRTCVN*QLQK 

AATCAATGGGTCGTTGGACCATTAATAGGAAAAGGTGGATTCGCT 960 

NQWVVGPLIGKGGFGS I YTT 
INGSLDH**£KVDSVVFILL 
SMGRWTIN RKRWIR* YLYY * 

AATGACAATAATTATGTAGTAAAAATAGAGCCCAAAGCTA 1020 
NDNNYVVKIEPKA 
MTIIM**K*SPKL 
*Q*LCSKNRAQSS 
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Fig. 21. 

ACCATCGAGGTAACCACCTCTCTGGAAGACAGCGTGAATAATGTACT 60 
TIEV TTSLEDSVNNVLMKRL 
PSR*PPLWKTA*IMYS*NVW 
HRGNHLSGRQRE *CTHETFG 

GAAACTATACGCCATATGTGGTCTGTCGTATATG ! 2 0 

ETIRHMWSVVYDHFDIVNGK 
KLYAICGLSYMII LIL*MVK 
NYTPYVVCRI*SF*YCEW*R 

GAATGCTGTTATGTGCATACGCATTTGTCTAATCAAAATCTTA^^ 180 
ECCYVHTHLSNQNLIPSTVK 
NAVMCIRICLI KILYRVL*K 
MLLCAYAFV*SKSYTEYCKN 

ACAAATTTGTACATGAAGACTATGGGATCATGCATTC^ 240 
TNLYMKTMGSCIQMD SMEAL 
QICT*RLWDHAFKWIPWKL* 
KFVHEDYGIMHSNGFHGSSR 

GAGTATCTTAGCGAACTGAAGGAATOVG^ 300 
EYLSELKESGGWSPRPEMQE 
S I IiAN*RNQVD GVPDQKCRN 
VS*RTEGIRWMESQTRNAGI 

TTTGAATATCCAGATGGAGTGGAAGACACTGAATC^ 360 
FEYPDGVEDTESIERLVEE F 
LNIQMEWKTL NQLRDW*RSS 
*ISRWSGRH*IN*EIGRGVL 

TTCAATAGATCAGAACTTCAGGCTGGTG^ 420 
FNRSELQAGESVKFGNS I N V- 
SIDQNFRLVNQSNLVILLML 
Q*IRTSGW»ISQIW*FY-C* 

AAACATACATCTGTTTCAGCTAAGCAACTAAGAA^ 480 
KHTSVS AKQLRTRIRQQLPL 
NI HLFQLSN*EHVYGSSFLY 
TYICFS*ATKNTY TAAASFI 

TACTCTC^TCTTTTACCAACACAAAGGGTGGATAm 540 
YSHLLPTQRVDICSLELIII 
TLIFYQHKGWIFVHWS * * * Y 
LSSFTNTKGGYLFIGVDNNT 

CACACAAAGTAATTGGATTCACGGTGGGTCATGACTACCTCAGACTGGTAGA 600 
HTK*LDSRWVMTTS D.W * R M I 
TQSNWIHGGS*LPQTGRE* V 
HKVIGFTVGHDYLRLVEND*! 

TAGAAAAGCATATCAAAAGACTTCGTGTTGTGCATTTCTGTGAGAAGAAAGAGGA 6 60 

•KSISKDFVLCISVRRKRTS 
RKAYQKTSCCAFL*EERGHQ " 
EKHIKRLRVVHFCEKKEDIK 
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Fig.21(Cont1). 

AGTAC^CGTGTCGATTCATCAAGGTATATAAACCTGGGGA 720 
STRVDSSRYINLGMRLPRHT 
VHVSIHQGI*TWG*GYLDIR 
YTCRFIKVYKPGDEATSTYV 

TGTGCGCTATCAAAGTGGAAAGATGCTGr^ 780 
CALSKWKDAVVLCLQIGQNH 
VRYQSGKMLLCCVCRLARIM 
CAIKVE RCCCAVFADWPESW 

GGTATATGGATACTAATGGTATCAAGAAGTATTCTC 840 
GIWILMV SRSILQMNGCHI* 
VYGY*WYQEVFSR*MGVTYK 
YMDTNGIKKYSPDEWVSHIK 

AATTTTAATIAATGTAATAGAGAACAAAIAAT^ 900 
NFN*CNREQIIRL*YHIDNN 
ILINVIENK**GCNII*TIT 
F*LM**RTNNKVVISYRQ*L 

TAACAATTAATTAGTAACTGTTATCTCT'ITIT1!AACT 960 
*QLISNCYLFFN*PTNYIPI 
NN*LVTVI SFLTNQLTI YLL 
TIN**LLSLF*LTN*LYTY* 

AATACATCGTAATTATAGTTCTTAACATCTATTAATCATTftA 1020 
NTS*L*FLTSINH*FASLIF 
IHRNYSS*HLLIINSLL*FF 
YIVIIV LNIY*SLIRFFNFL 

TATAAACTAACATTGTTAATTGAAAAGGGAlAAa 1080 
YKLTLLIEKG*HVTEYKLYM 
IN* HC*LKRDNMLQNINYIW 
•TNIVN. *KGITCYRI*IIYG 

GATTTTTTTAAAAAGGAAATACTTGACTGGAGXAT^^ 1140 
DFFKKEILDWSIYLSLHYIA 
IFLKRKYLTGVYIYLFII*H 
FF*KGNT*LEYIFISSLYST 



SUBSTITUTE SHEET 



WO 92/16636 PCT/GB92/00424 



Fig.21(Cont2). 



CGCGTGTTTTCCAATTTTTCOUaTCCCATAT^ 120 o 
RVFSNFSTSHIIQDYNL VRT 
ACFPIFPHPI'YRIIISFEH 
RVFQFFHIPYNTGL*SRSNI 

^yGAGAAAGTGGATAAAACAAXA G ir GA I 1 1 1 11 A111TA GGTTGCCAAATTIATTCCAT 1260 
TfEKVDKTIVDFLSRL PNLFH 
TRKWIKQ*LIFYLGCQIYSI 
R£5G # NNS*FFI*VAKFZPy 

ATTTTAGAATATGGGGAAAATATTCTACATATTTATTCTATGGATGATGCT^ 1320 

I LEYGENILHIYSMD OANTN 
F * N MG KI F Y I F I LWMML I R I 

FRIWGKYSTYLFYG*C*YEY 

ATTATAAT11 1 rmXJTAGATAGAGTATTAAATATTAATAAGWU^TCATTTATACAC 1380 

II IFFLDRVLNINKNGSFIH 
L *FFF*IEY«ILIRTGHLv T 

YNFFSR *SIKY **ERVIYTQ 

N LRLSSSINIKEYVYQLVNN 
I S GYH HPLI*KKMYIN*LIM 
SQVIIIH*YKRICISIS*«* 

GAXGATCCAGAXAATAGGAX&AGACXAATGCTTGAAAATGGACGTAGA i S00 
D H PDNRIRLMLEN GRRTRHF 
I IQI.IG *D*CLKMDVEQDII 
S SR* * D K T N A * KWT*NKTFF 
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Fig. 22. 

/_ 1 Vaccinia virus Vaccinia virus 

| a J H6 promoter 7.5K promoter 

Vaccinia virus DNA HPV16 E6-7 \ / HPV18 E6-7 vaccinia virus DNA 



MB17 1413 bp «B15 MB32 ^ m ^ 

The sequences present at the 5' end of PCR primers M15 and MB32 
overlap by 6 nucleotides 
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Fig. 24. 



(a) 
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The sequences present at the 5' end of PCR primers M15 and MB32 
overlap by 6 nucleotides 



(b) 



2kb- | 
1kb" - « 



Recombinant vaccinia virus plaque numbers 

^ 14 15 ^20 <T 




ff 



* * • • • 



PCR primers S6/MB15 



PCR primers S2^MB32 



SUBSTITUTE SHEET 



WO 92/16636 



PCT/GB92/00424 



3 % 



Fig. 25. 
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Fig. 26. 
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